Experimental Investigating the F-measure as Similarity Measure for Automatic Text Summarization
نویسندگان
چکیده
This paper evaluates the performance of different similarity measures in the context of document summarization. For this purpose in this paper a simple and effective sentence extractive technique is used. The proposed method is based on evaluation of relevance score of sentence. Many measures are available for the calculation of inter sentence relationships. To calculate a similarity between sentences we use cosine measure and classical IR F-measure. We present a comprehensive experimental evaluation two different document collection. Our experimental results show that F-measure lead to the best overall results than cosine measure.
منابع مشابه
A survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملLeveraging word embeddings for spoken document summarization
Owing to the rapidly growing multimedia content available on the Internet, extractive spoken document summarization, with the purpose of automatically selecting a set of representative sentences from a spoken document to concisely express the most important theme of the document, has been an active area of research and experimentation. On the other hand, word embedding has emerged as a newly fa...
متن کاملA method for Automatic Text Summarization using Consensus of Multiple Similarity Measures and Ranking Techniques
In the era of information overload, text summarization can be defined as the process of extracting useful information from a large space of available content using traditional filtering methods. One of the major challenges in the domain of extraction based summarization is that a single statistical measure is not sufficient to produce efficient summaries which would be close to human-made ‘gold...
متن کاملSummarization Evaluation : Correlating Human Performance on an Extrinsic Task with Automatic Intrinsic Metrics
Title of dissertation: Text Summarization Evaluation: Correlating Human Performance on an Extrinsic Task with Automatic Intrinsic Metrics Stacy F. Hobson Doctor of Philosophy, 2007 Dissertation directed by: Professor Bonnie J. Dorr Department of Computer Science Text summarization evaluation is the process of assessing the quality of an individual summary produced by human or automatic methods....
متن کاملA new sentence similarity measure and sentence based extractive technique for automatic text summarization
The technology of automatic document summarization is maturing and may provide a solution to the information overload problem. Nowadays, document summarization plays an important role in information retrieval. With a large volume of documents, presenting the user with a summary of each document greatly facilitates the task of finding the desired documents. Document summarization is a process of...
متن کامل